NoSym: Non-Symbolic Databases for Data Decoupling

نویسنده

  • Souleiman Hasan
چکیده

Under the Unique Name Assumption (UNA), users need to have shared agreements on signifiers to use in schema or data, e.g. to use “genre” and not “type” to refer to a movie’s category. Agreements are difficult in open environments such as datasets on the web, open data, and crowd-sourced databases, thus this assumption can be invalid. Schema matching and data integration can be limited in responding to this problem [2] as: (1) schemas might not be available a priori with schema-less data sources and queries becoming more common; (2) dataset-level schema/data mappings limit a user’s ability to provide a contextual interpretation of a signifier suitable for a specific query-data matching task; and (3) data integration typically has an overhead which hinders the availability and low latency of databases.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining

Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...

متن کامل

Beyond the Dichotomy of Symbolic versus Substantive Actions: Evidence from Corporate Environmental Management

The symbolic management literature explores loose coupling between substantive and symbolic aspects of organizational activities. The prior literature, however, focuses on the benefits of symbolic management and tends to treat it as costless. If symbolic management is costless, presumably all firms should pursue it, yet in practice they do not. In this paper, we extend the theory of symbolic ma...

متن کامل

Foundations of Data Mining and knowledge Discovery

This paper discusses a view to capture discovery as a translation from non-symbolic to symbolic representation. First, a relation between symbolic processing and non-symbolic processing is discussed. An intermediate form was introduced to represent both of them in the same framework and clarify the difference of these two. Characteristic of symbolic representation is to eliminate quantitative m...

متن کامل

Testing Database Programs using Relational Symbolic Execution

Symbolic execution is a technique which allows to automatically generate test inputs (and outputs) exercising a set of execution paths within a program to be tested. If the paths cover a sufficient part of the code under test, the test data offer a representative view of the program’s actual behaviour, allowing to detect failures and correct faults. Relational databases are ubiquitous in softwa...

متن کامل

ChromoViz: multimodal visualization of gene expression data onto chromosomes using scalable vector graphics

SUMMARY ChromoViz is an R package for the visualization of microarray gene expression data, cross-species and cross-platform comparisons, as well as non-expression genomic data obtained from public databases onto chromosomes. Chromosomal visualization format is proposed for the clear decoupling of the data layer from the procedure layer and the combined visualization of genomic data from hetero...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017